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Abstract. A connected Lie group G is generated by its two 1-parametric subgroups 
exp(iA), exp(ty) if and only if tlie Lie algebra of G is generated by {A, Y}. We consider 
decompositions of elements of G into a product of such exponentials with times t > and 
CN ■ study the problem of minimizing the total time of the decompositions for a fixed element of 
^ . G. We solve this problem for the group SU2 and describe the structure of the time-optimal 
p\| i decompositions. 

0. Introduction. 

■ In the axiomatics of quantum mechanics a state of a quantum system is a unit vector in 

a complex hermitian vector space and its evolution is given by unitary transformations. In 
quantum computing the space of states is finite-dimensional and the quantum computation 
u: is an element of SUn, which is a compact connected real Lie group. In order to carry out 
a quantum computation in physical reality, we need to be able to transform an initial state 
1^ ! '0 of a quantum processor into a state gt/j, where g is the element of SUn representing the 
I quantum computation. In the classical (non-quantum) case, a computation can be viewed 
as a boolean function in several variables. A processor can not implement an arbitrary 
function directly, but instead decomposes the computation into elementary steps, which is 
reflected in the process of programming. An efficient algorithm for a classical computation 
Q ! is a program that minimizes the number of elementary steps, which results in a shorter 
run time. 

Likewise, an implementation of a quantum computation g is a factorization of g into 
certain elementary factors, called the quantum gates (in analogy with the classical logical 
O ' gates used for the decomposition of an arbitrary boolean function of several variables in 
^ ' a disjunctive normal form, for example). Some quantum gates are the direct analogues 
of the classical logical gates and are discrete, while others depend on a continuous time 
parameter. 

^ ■ Time evolution of a quantum system is governed by a Hamiltonian H and is given by 

. the exponential exp{itH) G SUn. Here X = iH belongs to the Lie algebra sun. We are 
going to assume that all available quantum gates are continuous, since the discrete gates 
are physically realized as continuous gates applied for a specific finite time. 

This leads to the following quantum control problem: given a, set S C sun of quantum 
controls (gates), decompose an element g G SUn into a product 

5f = exp(tiCi) X . . . X exp(t„Cn), (0.1) 

where Ci E S, ti E M. This quantum control problem admits a solution for every g 
precisely when the set S generates the Lie algebra sun (see Theorem 1.1 below). This 
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result is proved using topological methods and does not provide an effective procedure for 
finding such decompositions. 

Minimal sets of quantum gates for quantum computing were discussed in [2] and [6], 
where it is shown that a set of two generic Hamiltonians is sufficient for controllability. A 
review of quantum control methods in physical chemistry is given in [7]. 

The problem of finding explicit factorizations of type (0.1) goes back to Euler [3], 
who studied it for the group SOs of rotations in M*^. The Lie algebra sos consists of 
skew-symmetric 3x3 matrices and the exponentials exp(tCj) corresponding to 
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are the rotations in angle t around X- and Y-axis respectively. Euler proved that every 
g e SOz can be factored as ^ = exp(tiCi) exp(t2C'2) exp(t3Ci) for some ti,t2,^3 £ [0,27r). 
The parameters ti, t2, ^3 are called Euler angles. This problem has applications in robotics 
when an object needs to be oriented in space in a prescribed way. 

For the quantum applications the parameters tj in (0.1) represent time, and thus 
must be positive. It is natural to pose an optimization problem in this context: among 
all factorizations (0.1) with positive times U, find the one that minimizes the total time 
ti + . . . + tn- This corresponds to finding an implementation of a quantum algorithm with 
the shortest running time. 

As we shall see in this paper, this optimization problem, as posed, might not have a 
solution - there may be a sequence of factorizations of type (0.1) with the number of factors 
n going to infinity, while the total time going to infinum. For this reason we slightly modify 
the above optimization problem and ask for the infinum of total times for all factorizations 
of a given g G SUn 

In this paper we solve this problem for the group SU2 with the set of controls S 
consisting of two elements, S = {X,Y}. 

We show that the infinum time does not change if we replace 5" = {X, Y} with its 
convex closure S = {tX + {1 — t)Y \ < t < 1}. In the latter case there will be in fact 
an optimal decomposition (0.1) with a finite number of factors. It turns out that to get 
an optimal decomposition it is sufficient to add to S at most one element W E S, the one 
that is orthogonal to X — y. In this paper we give explicit descriptions of the optimal 
decompositions in SU2- Since there is a surjective homomorphism SU2 — > SOz, our results 
are also applicable to the group SOs. 

Our solution is based on the method of Lagrange multipliers adopted for the set-up 
of Lie groups. Alternatively one could use the geometric theory developed in [4], which 
is based on the Pontryagin maximum principle. The general methods, however, give only 
necessary conditions for optimality, which in practice are not sufficient. To get the desired 
results we supplement these general methods with some explicit calculations in SU2 which 
allow us to get stronger and more explicit optimality conditions. 

We hope that our methods and results will help to solve this problem in greater 
generality. 
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1. Time-optimal decompositions in Lie groups. 

Let G be a compact connected real Lie group, and let g be its Lie algebra. An 
element X E Q defines a 1-parametric subgroup {ex.p{tX)\t G M}. In this paper we study 
an optimal control problem on G, describing optimal decompositions of an arbitrary given 
element g & G into a product of exponentials exp(tA) with X belonging to a fixed set S 
of controls, S C g and positive times t. 

The following well-known criterion describes when the group G is controllable by a 
set S: 

Theorem 1.1. ([4], Theorem 6.1) A real connected Lie group G is generated by 
its subgroups {exp(tX)}, X e 5, if and only if S generates the Lie algebra Q. 

In Theorem 1.1 the parameters in the subgroups {exp(tX)} could be both positive and 
negative. In applications to controllability of quantum systems, the parameters t represent 
time, and must be positive. If we restrict the question to controllability with positive time 
parameters, then the analogue of Theorem 1.1 still holds for compact Lie groups: 

Theorem 1.2. ([4], Theorem 6.3) Let G be a compact connected real group. If 
the set S generates the Lie algebra of G then every element g E G admits a factorization 

g = exp(tiCi) • . . . • exp(tnCn) 

for some n > with Ci E S and ti > 0. 

These theorems are proved using topological methods, and do not provide an effective 
way of finding such decompositions. In this context it is natural to pose the problem of 
describing decompositions that are time-optimal: 

Problem. For a given g E G determine 

iiif {h + ... + tn\g = exp(tiCi) • . . . • exp(t^C„), ti>0,C^ES}. (1.1) 

A compact Lie group G is isomorphic to a Lie subgroup in a general linear group ( [5] , 
Corollary 4.22), so we assume that G C GLd(F), where F = M or C. The Lie algebra q is 
then a real subalgebra in the matrix Lie algebra Md(F). 

Consider an arbitrary M-bilinear positive-definite scalar product (•, •) on Mrf(F). We 
can make it left and right G-invariant by integration over G using the Haar measure on G 
([5], Section IV.2): 

{A,B)= J J {gAh,gBh)dgdh. 

GxG 

Since the Haar measure on G is left- and right-invariant, this averaging procedure will 
yield a G-bi-invariant scalar product (■ , ■) on Md(F) which is still positive-definite. This 
scalar product defines a norm | ■ | on Md(F). We can rescale the above scalar product to 
achieve 

\AB\ < \A\ ■ \B\ for aU A,B E Md{¥). 
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The restriction of this scalar product to Q gives a positive-definite bihnear form on q, which 
is invariant under the conjugation action of G on its Lie algebra: 



{gAg-\ gBg-^) = (A, B) for A,BeQ,geG. 

Note that the Killing form on q yields a proportional scalar product when q is simple, 
but in case of a general compact Lie group G the Killing form has a disadvantage that it 
is only (negative) semi-definite ([5], Corollary 4.26), and may be degenerate, whereas the 
form constructed above is strictly positive-definite. 

We get the following estimate on the norm of a Lie bracket: 

\[A,B]\ = \AB-BA\ < 2 - |y4| • \B\. 

The exponential map 



oo ^ 

exp : ^ G, exp{A) = J] w^'' 



kV 

k=0 



is surjective for compact G ([5], Corollary 4.48), but is not injective. However locally, it 
maps bijectively a neighbourhood of G to a neighbourhood of 1 G G. The logarithm, 
which is a local inverse of the exponential map and assumes values in g, is defined for 
g E G satisfying \1 — g\ < 1 using the standard formula 



oo ^ 

lii{g) = - ^ -B^, where S = 1 



The following Lemma is a straightforward norm estimation with power series of oper- 
ators, and we omit its proof: 

Lemma 1.3. (a) | exp(A)| < e'^l, 

(b) |exp(yl + S)-exp(^)| <el^l (el^^l - l), 

(c) |exp(^ + S) -exp(^)| < el^l|5| + o(|S|), as \B\ 0. 

Let us recall the Campbell-Hausdorff formula ([1], n.6.4): 

exp(^) exp(S) = exp I J] Hr,s{A, B) , (L2) 

where 



m>l ■rx+.-.Tm=T v=i * * 
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and Hq^q(A, B) = 0. Here for a monomial U — A^^B^^ . . . A^'^B'^'^, we denote by [U] the 
iterated commutator, defined inductively for C G {A, B} as [C] = C, [CU] = [C,[U]]. 
The iterated commutator [A'^'^B^'^ . . . A^'^B^'^] is zero unless either Sm = 1 or Sm = and 

The first few terms of this series are: 

HrAA B) = A + B+^[A,B] + ^[A, [A, B]] - [A, B]] + .... 

r,s>0 

Let us discuss convergence of the Campbell-Hausdorff series ^ Hr,s{A, B). Consider 

r,s>0 

a function in two variables 

/K^) = -^ln(2-e^"+^-). 
We can expand this function in a Taylor series 

f{u,v) = ^ r]r,su''v^, 

r,s>0 

which is absolutely convergent when \u\ + \v\ < 

By Lemma IL7.1 in [1], we can estimate the terms of the Campbell-Hausdorff series: 

\Hr,s{A^B)\<r^r,s\A\"W- 

Thus the Campbell-Hausdorff series is convergent in g and the equality (1.2) holds when 
1^1 + \B\ < In addition, a tail of the Campbell-Hausdorff series may be estimated 
with the corresponding tail of the scries /(|^|, \B\). 

We will also use the following well-known formula ([5], Proposition 1.93): 

exp{A)Bexp{-A) = exp(adA)S, 

where ad(A)S = [A,B]. 

Theorem 1.4. Optimization problem (1.1) in SU2 with 5" = {X, Y} is equivalent to 
the problem with the set of controls S = {tX + (1 — t)Y \t E [0, 1]}. 

Proof. We need to show that for every r G [0, 1] the infinum of time in (1.1) with 
S — {X, ¥} for g — exp(TX -|- (1 —t)Y) is less or equal to 1. It is well-known that g can be 
approximated with an arbitrary precision in time 1 (see e.g., Theorem 3.7 in [4]). Indeed, 
let A = tX, B = {1- t)Y. Then 



g = exp(^ + B) = lim 



A\ /S^^^ 



(1.3) 



To see this, we apply the Campbell-Hausdorff formula. 



from which (1.3) follows: 



A 



B 



N 



exp(^ + S + 0(l/iV)). 



We need to prove a stronger result, that g itself can be decomposed in a product of 
exponentials with control set S with a total time not exceeding 1 + e for an arbitrarily 
small e > 0. 

A direct computation with the Campbell-Hausdorff formula shows that we can improve 
the approximation (1.3) in the following way: 



h 



N 



exp I ^) exp (I) exp ( 



N 



exp(^ + S + C>(l/iV2)) 



(1.4) 



Let us show that we can attain exp {A + B) by a small variation of time parameters in 
(1.4), while keeping these parameters positive. Let i5(e) = {g & G\\1 — g\ < e}. Since 
the norm is G-invariant, we get that the ball in G of radius e with the center in /i G G 
can be written as hB{e) = B{e)h. We are going to show that by varying three of the 

parameters by , we can cover the set hisfB (^cN~^^, which will contain exp{A + B) for 

large N by Lemma 1.3(c). In order to show that a variation of certain three parameters 
covers a ball around Hn, we need to establish that the corresponding jacobian is non-zero. 
The computation of the jacobian takes place in the Lie algebra su^. Since {A, B} is a 
generating set for this Lie algebra, the set {A,B, [A,B]} forms a basis of SU2. The tangent 
vector to exp (t ■ ad{A + B)) B at t = is [A, B], which implies that for some M G N, the 
Lie algebra SU2 is spanned by {A, B, exp (^M~^ad{A + S)) i? j. Fix M, and assume that 
AT is a multiple of M, N = MK. We consider the following variation of (1.4): 



exp 1 ^ + eiA 



exp 



A 



B 



K-l 



A 



B 

X exp ( — + esB 



exp 



A 



B 



N-K-l 



exp 



A 
2N 



The differential of this variation is 



exp ( { eiA + e2B + 63 



B 



A 



K 



B 



B 



A 



-K' 



X 



B 



A 



N~l 



B 



exp 



A 
2N 



Since 



lim 

JV->-oo 



B 



exp I - 1 exp ^ 



A 



B 



B 



A 



-K 



exp (M-iad(A + S)) S, 
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we conclude that for large N the jacobian of this variation is non-zero. Thus by vary- 
ing the corresponding time parameters with |ei|, |e2|, jeal < N~2 ^ we can cover the set 

, for some constant c > that depends on A and B. Since approximates 

g = exp(A -I- B) with precision 0{N~'^), this set contains g for large N. This completes 
the proof of the Theorem. 

Since a uniform rescaling of the controls X' — cX, Y' = cY, gives an equivalent 
optimization problem (with optimal time rescaled by a factor c~^), we may assume without 
loss of generality that \X\ = 1 and |y| > Let n = l/\Y\ < 1. We may pass to the 
normalized set of controls {X, y/|y|} by replacing the total time ti + . . . + tn for the 
decomposition exp(tiCi) • . . . • exp(/;^C„) with the cost function 

^Kiti, where I ^'^ if ^cf^ y/[y|. ^^"^^ 

i=l ^ 

From now on we assume that 5" = {X, y } with \X\ = \Y\ = 1 and consider the 
optimization problem with cost function (1.5) where the cost factor k < 1. This will allow 
us to consider the limiting case k = 0, when there is no cost associated with control Y. 

Let us introduce some terminology. 

An admissible word of length n is an expression exp(tiCi) • . . . • exp(t„C^) with n > 0, 
ti >0 and Cj G 5'. A word of zero length is the identity clement of G. 

Every admissible word can be written in a reduced form, where ti > and Cj ^ Cj+i 
for all i. 

A decomposition of G G as an admissible word of length n is called n-optimal if it 
has the minimum cost among all admissible words of length n that are equal to g. 

A decomposition of ^ G G as an admissible word of length n is called optimal if it has 
the minimum cost among all admissible words of arbitrary lengths that are equal to g. 

For a given G G, an optimal decomposition may not exist since there might be a 
sequence of decompositions of g of increasing lengths and with cost going to infinum. On 
the other hand, n-optimal decompositions exist, as we show in the following Lemma. 

Lemma 1.5. Let G be a connected Lie group, and suppose that the set of controls 
S is finite and satisfies the following condition: every generator G G 5" with a zero cost 
factor n — Q has a periodic exponential, i.e., exp(TG) = 1 for some T > 0. If G G has a 
decomposition as an admissible word of length n then it has an n-optimal decomposition. 

Proof. Let tjnf be the infinum cost over the set of all decompositions of g in admissible 
words of length n. Let 

{exp(4^-^Gp^)-...-exp(4^)GC^))^^^^ 

be a sequence of admissible words of length n with cost converging to tinf- Since the set 
of controls S is finite, we can choose a subsequence in which the generators {G^"'''} are 
independent of j. 

All times t^^"^ are bounded by the same constant (for the generators with non-zero 
cost factors a bound is obtained from the bound on the total cost, and for other generators 
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from the periodicity assumption). Thus there is a subsequence where all times converge 
to some values, lim tl"''' = tk- Then by continuity, exp(iiCi) • . . . • exp(t„C„) = g and the 

cost of the word exp(tiCi) • . . . • exp(t„Cn) is t[^f. 
The following Lemma is obvious: 

Lemma 1.6. (a) If a word of length n is optimal then it is n-optimal. 

(b) If the word 

exp(tiCi) • . . . • exp(t„Cn) (1.6) 
is optimal (resp. n-optimal), then its subword 

exp(tpCp) • . . . • exp(tfcCfc) 

with l<7?<A;<nis also optimal (resp. k — p + 1-optimal). 

(c) If the word (1.6) is optimal then 

exp(siCi) exp(t2C2) • . . . • exp(t„_iCn-i) exp(snCn) 

with < si < ti, < Sn < tn, is also optimal. 

The last claim of the lemma suggests that for the optimal words there are stronger 
constraints on time parameters U with 2 < i < n — 1. We will call these the middle time 
parameters. 

2. Optimal words in SU2. 

For the rest of the paper we will focus on the case G = SU2. We find it convenient to 
use the realization of SU2 as the unit sphere in the quaternion algebra H: 

SU2 = {al + bi + cj + dk\a'^ + + + d'^ = 1} . 

The Lie algebra SU2 in this realization is the tangent space at 1 and has basis {i,j,k}. 
This basis is orthonormal relative to the invariant bilinear form. The norm in EI satisfies 
\xy \ = \x\ ■ \y\ and is S't/2-bi-invariant. 

The isomorphism with the standard matrix construction of SU2 is given by Pauli 
matrices: 

^^(0 -i)' ^'^(-1 J)' ^^(z o)- 

The group SU2 acts on its Lie algebra by the conjugation automorphisms. Since — / 
acts trivially, this action factors through SO2 = SU2/ {^I}, and is given by the natural 
action of SO3 on R^. 

The action of SO3 on the unit sphere is transitive, so without loss of generality we may 
assume that X = i, while Y = icosa + jsina, where a is the angle between the vectors 
X and y, < a < TT. This identification will allow us to carry out certain calculations in 
an explicit form. If C is an element of SU2 of norm 1 then exp(tC) = cos(t) + Csin(t). 

Since exp(7rX) = exp(7ry) = — 1 is a central element, we see that an n-optimal word 
satisfies the following 
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TT-Condition: at most one time parameter may be greater or equal to tt; without loss 
of generality we may assume that this parameter corresponds to the generator Y, as it has 
a lower cost, and is not a middle time parameter. 

We begin by describing 4-optimal words. 

Proposition 2.1. (a) Let 

g = exp(tiX) exp{t2Y) exp{t3X) exp{t4Y) (2.1) 
be a 4-optimal word with ^1,^2,^3 < tt. Then either 

tan(t2) _ « - cos(q!) , , 

tan(t3) 1 — kcos(q;) 

or (2.1) is not reduced. 

(b) The same condition holds for a 4-optimal word 

exp(t4F) exp^t^X) exp{t2Y) exp(tiX). 

Proof. We will apply a version of the Lagrange multipliers method. Consider an 
infinitesimal variation of (2.1): 

exp((ti + €i)X) exp((t2 + e2)Y) expats + €3)^) exp((t4 + €4)!"), (2.3) 

with the constraint that the product still equals g. Carrying out calculations to the first 
order in e^, we have exp(eiCi) 1 + ejC^. We will collect all terms with in the middle 
of the word (2.3), using the relations: 

eiXexp{t2Y) = exp{t2Y) exp{-t2Y)eiX exp{t2Y) 

= exp{t2Y) [(cos(t2) - sin(t2)l')eiX(cos(t2) + sm{t2)Y)] 
= exp{t2Y)ei [cos^{t2)X - sin\t2)YXY + sin{t2) cos{t2)[X,Y]\ . 

Similarly, 

exp(t3X)e4y = €4 [cos ^{ts)Y - sm\ts)XYX + sin(t3) cos(t3)[X, Y]] exp{t3X). 

Setting Z = F] = sin(Q!)A;, we can get the following expressions in the basis 
{X,Y,Z}: 

XYX = F - 2 cos(a)X, YXY = X -2 cos{a)Y. (2.4) 
Using the above relations, we see that to the first order in ej, (2.3) can be written as 

exp(tiX) exp(t2F) [(ei cos(2t2) + £3 + 2e4 cos(q;) sm^{ts))X 

+ (2ei cos(q!) sin^(t2) + 62 + 64 cos(2t3))y + (ei sin(2t2) + 64 sin(2t3))Z] exp(tsX) exp(t4y). 
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Equating the middle factor to zero, we get the system: 

ei 003(2^2) + £3 + 2e4 cos(a) sin^(t3) = 0, 

2ei cos(ct) sin^(t2) + 62 + 64 003(2^3) = 0, (2-5) 
61 sin(2t2) + 64 sin(2t3) = 0. 

If this system has rank 3 then by the imphcit function theorem there exists a smooth curve 
in the space of parameters {(^1,^2,^3,^4)} such that the product (2.1) is equal identically 
to g on the curve. The tangent vector to this curve is a non-zero solution of the system 
(2.5). Since by assumption the word (2.1) is 4-optimal, the differential of the cost function 

61 + K62 + 63 + K64 

must be zero on the tangent vector of the curve. Thus the determinant 

cos(2t2) 1 2cos(a)sin^(t3) 
2cos(a)sin2(t2) 1 cos(2t3) 
sin(2t2) sin(2t3) 

1 K 1 K 

must be zero, since the corresponding homogeneous system of equations has a non-trivial 
solution. In case when the system (2.5) has rank less than 3, the above determinant is still 
equal to zero. Evaluating this determinant we get: 

4sin(t2) sin(t3) ((1 — kcos(q;)) sin(t2) cos(t3) — (k — cos(a)) cos(t2) sin(t3)) = 0. 

Since < ^2,^3 < ti", we can cancel the factor 4sin(t2) sin(t3) and obtain the claim of the 
proposition. 

Part (b) is completely analogous. 

Remark 2.2. The denominator of /^~'^°^(") ig non-zero since < k < 1 and 

1— /tcos(a) — — 

|cos(q;)| < 1. If the numerator of this fraction vanishes, the condition (2.2) should be 
replaced with 

(1 — kcos(q!)) sin(t2) cos(t3) = (k — cos(q;)) cos(t2) sin(t3). (2.6) 

Using Proposition 2.1 and Lemma 1.6 we get a description of n-optimal words: 
Corollary 2.3. Suppose n ^ cos(q;). Let exp(tiCi) exp(t2C'2) ■ . . . ■ exp(i„Cn) be a 
reduced n-optimal word with n > 4. Then 

tp = tp+2 for aU 2<p<n- 3. (2.7) 

Since in an n-optimal word all middle parameters corresponding to the same control 
are equal, we will denote by tx (resp. ty) the middle time parameters corresponding to X 
(resp. Y). 
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Corollary 2.4. Under the assumptions of the previous corollary, 

tan(tj,) K — cos(a) 
tan(ta;) 1 — kcos(q:) 



(2.8) 



We see from Corollaries 2.3 and 2.4 that reduced n-optimal words are described with 
at most three independent time parameters for all n. Since the group SU2 is three- 
dimensional, we conclude that for each n there exists only a finite number of n-optimal 
words representing a given g G SU2 (for the case n — cos(q;) see Theorem 2.12 below). 

Next we shall investigate optimality of words of length 3. 

Proposition 2.5. Let cos(t) > and let e > be a small parameter. Then 

{i) exp(eX) exp(tF) exp(eX) = exp(Tr) exp(//X) exp(TF) (2.9) 

and 

{a) exp(ey) exp(tX) exp(ey) = exp(TX) exp(//y) exp(TX), (2.10) 

where 

T = t/2 + ecos(a)(l - cos(t)) + o(e) (2.11) 

and 

/i = 2ecos(t) +o(e). (2.12) 
Proof. Let us write both sides of (i) in the form a + bX + cY + dZ, using the relations 

XY=- cos(a) + Z, YX = - cos(a) - Z, (2.13) 

together with (2.4). We immediately see that d = 0, while 

a = cos(2e) cos(t) — cos(q!) sin(2e) sin(t) = cos(2t) cos(//) — cos(q;) sin(2T) sin(//), (2.14) 

b = sin(2e) cos(i) — 2cos(q;) sin^(e) sin(t) = sin((U), (2-15) 

c = sm(t) — sin(2T) cos(//) — 2 cos(q;) sin^(T) sin(//). (2-16) 

We can determine fj, from (2.15), getting (2.12). 
From the equality 

(c + 6cos(q;)) cos(//) — a cos(q;) sin(//) = sin(2r)(l — sin^(Q!) sin^(/i)) 

we determine r and obtain (2.11). It can be seen that this indeed gives a solution of (i). 
The condition cos(t) > is required to ensure the positivity of fx. The claim for part (ii) 
follows by symmetry. 

Corollary 2.6. Let < t < ^ and let e > be a small parameter. Then 

(i) The word exp(eX) exp(ty) exp(eX) is not 3-optimal. 

(ii) If K > cos(q;) then the word exp(ey) exp(tX) exp(ey) is not 3-optimal. 
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Proof. We are going to show that the cost of the right hand sides of (2.9) and (2.10) 
are smaUer than for the left hand sides. Let us evaluate the differences in these costs: 

2e + K,t — 2k,t — h = 2e + nt — Kt — 2Ke cos(q;)(1 — cos(t)) — 2e cos(t) + o(e) 

= 2e(l - cos(t))(l - kcos(q;)) + o(e) > 0, 
which proves (i), while for (ii) we have: 

t + 2Ke — 2t — Kfj, = t + 2Ke — t — 2ecos(Q;)(l — cos(t)) — 2Kecos(t) + o(e) 

= 2e(l - cos(t))(K - cos(q!)) + o(e) > 0. 

Corollary 2.7. Suppose n > cos(q;). A reduced optimal word exp(tiCi) . . . exp(t„C„) 
of length n > 1 satisfies the strong tt -condition, which is the 7r-condition with an additional 
restriction ti,tn < tt. 

Proof. It is sufficient to show that for a reduced optimal word exp{tiY) exp(t2X) of 
length 2, we have ti < tt. Suppose ti > tt and let < r < min(t2, f )• Then 

exp(tiF) exp(t2-^) = exp((ti — 7r)F) exp(TX) exp(7rF) exp((t2 — 

which is not optimal by Corollary 2.6(ii). This contradiction implies ti < tt. 

Now we can describe possible optimal decompositions in SU2. We will consider several 
cases. 

Theorem 2.8. Suppose cos(q!) < k < 1. Then the infinum of the cost of admissible 
decompositions of a given element of SU2 is attained either 

(a) on a reduced n-optimal word 

exp(tiCi) exp(t2C2) • . . . • exp(t„C„) 

satisfying the strong 7r-condition, (2.7) with ^ < tx < tt, ^ < ty < tt, and when n > 4 the 

condition (2.8). 

or 

(b) on a word 

exp(tiCi) exp{t2W) exp(t3C3), 
where Ci, C3 G {X, Y}, ti,t2, ts > and 

W =(1- KCOs(a))X +(k- cos(Q!))y 

with the cost of ex.p{t2W) equal to (k^ — 2kcos(q;) + 1)^2- 

Proof. It is clear that the infinum of the cost is either attained on an n-optimal word, 
or is a limit of costs for a sequence of reduced words of length k which are A;-optimal with 
k — >■ 00. 
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In the first case, the n-optimal word must satisfy the conditions in (a) above by- 
Corollaries 2.3, 2.4, 2.6 and 2.7. 

Alternatively, if for g G SU2 the infinum of the cost is the limit of costs for a sequence 
of /c-optimal words of increasing length, 

g = exp(4^)cp')) exp(4^-)c^^-)) • . . . • exp(tif C^f ), j = 1, 2, 3 . . . , (2.17) 

there will be a subsequence with the fixed first and last generators, Cp'' = C ,c\^^ = C" . 
Since all time parameters belong to the compact set [0,27r], there is a subsequence where 
{t^i^} and {^fc'^?} converge. 



lim 4-^^ = t\ fim 4''.^ = t"- 

Time parameters in /c-optimal words satisfy the conditions of Corollaries 2.3 and 2.4. 
Denote by tx^ the middle X-times and by ty^ the middle Y-times in (2.17). Then 

g = lim exp(t'C") fexp(t(/)X) exp(t(/V)') ^ ' ^ exp(t"C")- 
Since the total cost is bounded, we can choose a subsequence for which the sequence 



tx"* converges, and let 



lim 



kj 



By CoroUary 2.3 

tan(4''^) , K — cos(q;) 



= A, A = 



tan(4-''^) ' 1-kcos(q!)' 



Aj) _ 

Ly — 



and using the fact that arctan(Atan(t)) = At + o{t) as t ^ 0, we get that lim 

J-)- 00 

Xtx ■ 

By (1.3) we get that 

g = exp(t'C") exp(t^(X + XY)) exp(t"C"). 

Finally, we note that tx{X + XY) — iW , where W = {1 — Kcos{a))X + {k — cos(q;))F and 
i— tx/{i — kcos(q;)). 

Remark 2.9. The vector W is orthogonal to the line passing through X and Y/n. 

Remark 2.10. Time-optimal decompositions that appear in Theorem 2.8 involve at 
most three independent time parameters. Since the group SU2 is 3-dimensional, there will 
be a finite number of such decompositions of each length. Moreover, since middle times in 
the decomposition (a) above are at least ^, any given decomposition of g gives a bound 
on the length of an optimal decomposition of type (a) . 
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Theorem 2.11. Suppose < k < cos (a). Then the infinum of the cost of admissible 
decompositions of a given element of SU2 is attained on a reduced n-optimal word 

exp(tiCi) exp(t2C2) • . . . • exp(t„Cn) 

satisfying the 7r-condition, (2.7) with ^ < ty < tt, and when n > 4 the conditions < 
f and (2.8). 

Proof. In this case the infinum of the cost can not be realized as a limit cost of a 
sequence of n-optimal words of increasing lengths since by Corollary 2.4 in an n-optimal 
word we have either > ^ or ty > ^, which implies that the cost would go to infinity as 
n — >■ 00. 

Applying Proposition 2.1 and Corollary 2.6, we see that for a reduced optimal word 
of length n the claim of the theorem holds. 

Theorem 2.12. Suppose n = cos(q!) > 0. Then the infinum of the cost of admissible 
decompositions of a given element of SU2 is attained on a reduced n-optimal word 

exp(tiCi) exp(t2C2) • • • • • exp(tnCn) 

satisfying the 7r-condition, (2.7) with f < < tt, and when n > 4 the condition tx = ^■ 

Proof. Applying Proposition 2.1 (see also Remark 2.2), n-optimal words with n > 4 
have middle X-timcs equal to ^. This implies that the infinum of cost is attained on an 
n-optimal word and is not a limit for a sequence of n-optimal words of increasing length. 
It remains to show that the middle F-times are all equal to each other. This can be 
done using the same method as in the proof of Lemma 2.6, by considering a the following 
variation of a 5-optimal word: 

exp((ti + ei)X) exp((t2 + e2)Y) exp exp((t3 + €3)!") exp((t4 + €4)^). 

Evaluating the resulting 4x4 determinant, we get 

8cos(q!) sin^(Q;) sin(t2) sin(t3) sin(t2 — ^3) = 0, 

which implies t2 = t^. 

Finally let us consider the case when the cost associated with the generator Y is zero. 
In this case we have the freedom of replacing the generator Y with —Y since exp{—tY) = 
exp((27r — t)y). Thus without loss of generality we may assume that cos(q!) < 0. The case 
when cos(q;) < is then covered by Theorem 2.8. 

Theorem 2.13. Let cos(q;) = and k = 0. For any g e SU2 the infinum cost is 
attained on a word of length at most 3. 

Proof. By Proposition 2.1, in an n-optimal word of length n > 4, the middle X-times 
are equal to ^. Thus the infinum cost is attained on an n-optimal word for some n. Let for 
a given g, n be the smallest length such that the infinum cost is attained on an n-optimal 
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word. Suppose the optimal word has a middle factor of exp{^X). Applying the following 
identity 

exp(tiF) exp exp{t2Y) = exp((ti - t2)Y) exp = exp exp((t2 - ti)Y), 

we see that such a word is not optimal. This implies n < 3. 
In conclusion, let us consider a particular case. 

Example 2.14. Let X = i, Y = j and k = 1. Then an optimal decomposition of any 
g E SU2 is given by words of the following types: 

(a) exp(tiCi), where < ti < 27r; 
(b) exp(tiCi)exp(t2C'2), where 0<ti,t2<7r; 

TT 

(c) exp(tiCi)exp(t2C'2)exp(t3Ci), where t2 > -, ^1,^2,^3 < tt; 



TV 

(d) exp(tiCi) exp(t2C2) exp(t2Ci) exp(t3C2), where t2 > —, ti,t2,t3 < tt; 



or 



(e) exp(tiCi) exp I t2^^ ) exp(t3C3), 



2 

where Cj e {X, F}, C2-, with the infinum time tfc. Here exp (i^^) may be viewed 

as 

TV 



exp ( t — - — 1 = lim 

^ * 2 / N^oo 



ti \ / tj 



expi^ exp 



We obtain this result by applying Theorem 2.8. The only thing we need to show is 
that there are no optimal words of length n > 5. Suppose such a word is indeed optimal. 
Then it is also n-optimal and by Theorem 2.8, all middle times satisfy < tx = ty < tt. 
However, 

exp {^i^ exp{tyj) exp (^^i^ = exp(7ri) exp{-tyj) = exp(7rj) exp{-tyj) = exp((7r - ty)j), 

which has a lower total time. The analogous equality holds if i and j are switched. This 
shows that if the infinum time is attained on a word of length n then n < 4. 

It is fairly straightforward to write down explicit formulas for ti, ^2)^3 in (a)-(e) above 
for a given g e SU2- 
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